Normal Colon Ife^nnple ^aj^g^vy^ixt^fi^ 

bo *-* 

Genes that are expressed in normal colon, that are not expressed at detectalg£ g ppj 
levels in colon adenocarcinoma 

Below is a listing of those genes that are expressed at appreciable levels xnz> 
normal colon, but that do not appear to be expressed in colon adenocarcinoma. 
There are 333 sequences presented in the listing below. 

>gi| 1472311 |gb|AA011199.1|AA011199 ze23c03.sl Soares_f etal_heart_NbHH19W Homo 

sapiens cDNA clone 

IMAGE: 359812 3', mRNA sequence 

TTTTTTTGCTTTTCCCCCTCTATCTCAAATGCCTCACTCCGGGACAAATATTCCTTCTTAGCACTACTTA 
ACATAAATAAATCAATTTTGGCTCTTCAAAGTCATTCTGCATATAATCCTGTCACCCTTATAGGAGCCTA 
CTGTACATTCTCATTGTGAACCAATCTCTCTTTACATTCACAATATTATGATGCNTTCACNGGGAGGGGC 
TCTTCCCCAAGGTAAATGTAGCTCATTTAANAAACTACAGTGGGGCTCTGTGATCTATGTACACATGTAT 
GTCACATATTTGACCATTTAAAAAACACAGAAACCCTTGAGACTTTCTGTAAAATTTTGGGATCATAAAA 
TATATTAAAAAGCAGACCAAAGGCAAGGCATTCTGGTTCTCTGACGTCCCCCGGTCTAGTTTAATTCATT 
TTCCCAGTATTGGGGAAAAGCAGGGCA 

>gi|1479353|gb|AA016979.1|AA016979 ze41h01.sl Soares retina 

N2b4HR Homo sapiens cDNA clone IMAGE: 361585 3' similar to gb | M10329 | MUSUR48S 
Mouse 4.8S U6 small 

nuclear (rRNA) ; contains Alu repetitive element;, mRNA sequence 

AAATATGGAACGCTTCACGAATTTGCGTGTCATCCTTGCGCAGGGGCCATGCTAATCTTCTCTGTATCGT 

TCCAATTTTAGTATATGTGCTGCCGAAGCGAGCACCGTGCTTAGTTATTCTAAGTGAGGGCCCCAGGATC 

CACCTGCCTAGGCTTCCCAAAGTGCTGGGATTACAGGCGTGACCCACCGCGCCCAGCCAAGTTTTGGTTT 

CCTCAACTGGAGGTAATATTACATATTTTACTTATACATATGCATAAGTAAACAAAGAGGGTTGTTTTGA 

GGGTCAAATAAATTGATGGATGTTAACGCTCTNCTGGTAAATTATAAAGCACTATACAAATACAAGGCAT 

TATTGTTAATAATAGAGCTTAATTACACCTGTCCTCATTTGATCTCTCANAGACC 

>gi|1493220|gb|AA027011.1|AA027011 zk02c08.sl Soares_pregnant_uterus_NbHPU Homo 
sapiens cDNA clone 
IMAGE:469358 3', mRNA sequence 

GTTTTAAAACATTTCTTTATTAGTATATAGACAGTAAAGCATGAAATAGATACAAACATTACTTATAAAA 
ATGTTTTGAAAGAACATTTGAAAAATAGATGAATGTCTTCTAGCCAGTTAATAGCAGAGAAAGAATTTAG 
TTTTGGTAGCTCATAAGTCAGTAACCGTATGCCATGTCTCCAGAAGTAAAATCCGTCTGTTTTCCAGAAA 
AATGTGATGTAGNGAATTNTCATTTTATGTGTTATTTTGCACTCATTAATGTAAATTTTAGATTTAAAAA 
AATCAAGTTTATTTGCTTTCTAAGAAAATGGNCTCCTTNCCCATTCGCCAGTAGNTTAATATATGTTCTA 
CGGTGTGGGTGTGT 

>gi 1 1506906 | gb | AA034962 . 1 1 AA034962 zk25h03 . si Soares_pregnant_uterus_NbHPU 
Homo sapiens cDNA clone IMAGE:471605 3', mRNA sequence 

TTTTTTTTTTGCAAGAAACACATGGGGATGGTTTTATTTTTATAATTGGTTAAAAAAGTTTCTCAAAGTG 
AAGTTTAGGAGTGAGTTTTAGTGTTTTGTAATTTGAAGAAAAGTGTCTTAAAATTCAAAGATACCAATGG 
AAAGATAAAAGTTTGGGG 



>gi| 15124 87 | gb | AA037388.1 |AA037388 zc03e01.sl 

Soares_parathyroid_tumor_NbHPA Homo sapiens cDNA clone IMAGE: 321240 3', mRNA 
sequence 

GATTTTTCAAAATTAACTTTTTTATTAATTTAAAAATCCAGAAATACAGTGACTACATAAATAAGTACCA 
TAATTAGGTACATGTCCTGTGAGAACAGTGAAAGGGTAATACTGTTATGTTACTCTTACTTGTTTACATG 
AGTTAACTAGAAAATGGCTACAACTGCTAAATGATGCTTATGGTCTTTGTTGTTCCAAGTGTTTATGATA 
CAAATAAATACACAAGAAGAACCACATCCATTCTTCTCTACTAACTACAGGCAGCTTGGGG 



<110> Smith, John; .-Smlthgcno Inc. 

<120> Example of a Sequence Listing 

<130> 01-jOOOOI 

<\C0> PCT/EP98/00001 

<ltl> 1998-12-31 ', v / 

<150> US 08/999. 999 :5 ^ 

<l*i> 1997-10-15 

<1C0> < \S 

<170> Patcntln version 2.0 



<210> 1 

< 2 1 1 > 389 

< 2 1 2 > l)NA s 

< 2 1 3 > Paramecium sp. 



<220> 

<22 1> COS 

<222> (279) . . . (J89) 

<300> 

<301> Doc. Richard 

<)02> Isolation and Cha ractcr i ia t i on of a Ccnc Encoding a 

Protease from Paramecium sp 

<303> Journal of Ccncs 

<)04> 1 

<30S> 4 

O0C> 1-7 

<)07> 1988-06-31 

O00> 1234SC 

O09> 1988-0C-31 

<400> 1 . 

agctgiagtc attcctgtgi" ccccttctct c tgggc i cci caccctgcta atcagatctc 



agggagagtg tcttgaccct- cctctgcctt tgeagcttea caggcaggoa ggcaggcagc 



120 



cgaigtggca attgetggea gcgccacagg ctccLcagcc aggcttaggg cgggttccgc 100 
cgcggcgcgg cggcccctct cgcgctcctc tcgcgcctci ctcccgccct cccctcgctc 240 



3 
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Appendix 3. page 2 . 

«.cct C .ct .«t„ 5 «, 9 w«ew c.yct.gc yet cc* £9 tec 

1 .5 

tto tct etc a*a too «t gga ttt tgt ttg tct * gtt tgt ttg ttc 

tel ttr Phc Lys ,-tS rro cly The Cys. Leu The Val Cys Leu The 

tot ccc «a gtc etc ccc tgt cac tea tea ctg cag ccg . aat ctt 

Cys Pro Lys ,Val Leu Pro Cys His Scr Set -Leu . CI n Pro Asru.-b«u. 

23 30 \ 35 



<210> 2 

<211> 37 

<212> PRT 

<213> Paramecium sp. X 

Mct^Val Sc? Met Phc Scr Leu Scr Phc Lys Vrp Pro Cly Phc, Cys 

Plic Val Cys Leu Phc Cln Cys no Lys Va 1 Leu Pro Cys Mis Scr 
20 2* 30 

Leu Cln Pro Asn Lou 

<210> ) 

< 2 1 1 > 11 
<212> PKT 

< 2 1 3 > Artificial Sequence 
<220> 

<223 > Designed peptide hosed on size and polarity to act a 
linker between the alpha and beta chains of Protein XYZ. 

<400> ) 

net Val Asn Leu Clu Pro Met Mis Thr Clu lie 

1 S 10 



1 



<210> < 
<<00> < 
000 



(Annex VIII follows! 



cue 101 j^jwi ny 

•pquence 



iuein-AtActA- alia tuext ^^umH<"<y a "V aiilv 

cable. The numeric ide«ficr shall be used only xn 'Jj^. .„ chc 

Listing." The order an^rcscntation of the items 01 x m*. 
••Sequence Listing" shall conform to the "" n 9«^ n < wit^tJe 
item of information shall begin on a n w line and shall J^gin with X he. . 
numeric identifier enclosed in, angle brackets as shown The submission 
of those items of information designated with an M " roa " da ^ or y-- TnC 
submission of those items of information designated with an O is 
optxonal. Numeric identifiers <110> through <170> shall only be set • 
?orth at the beginning of the.-Sequenc Listing." T]>e following table 
illustrates th numeric identifiers, % 



Numeric 
Identifier 

<110> 



<120> 



<130> 



<\<0> 



< 1 4 1 > 



<150> 



< 1 5 1 > 



<1G0> 



<170> 



<210> 



<21 1> 



Definition 



Applicant 



Title of 
Invention 

File Reference 



Comments and 
Format 

Preferably max. 
of 10 names; 
one name per line; 
preferable ..format: 
Surname, Others 
Names and/or 
Initials 



Personal file 
reference 



Mandatory (M) or 
Optional SO) 

M v ' 



M 



Current Applica- Specify as: 
ti.pn Number US 07/999, 999 or 

PCT/US96/99999 



M^whcn filed prior 
to assignment of 
appl . number 

M, if available 



Current Filing Specify as: yyyy-mm-dd M. if available 
Date 



Prior Application Specify as: 
Number US 07/999, 999 or 

PCT/US9G/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
Filing Date 

Number of SEQ ID Count includes 



M 



NOs 



Software 



SEQ ID NO: « 



Length 



total number of 
SEQ ID NOs 

Name of software used 0 
to create the 
Sequence Listing 

Response shall be an M 
integer repre- 
senting the SEQ 
ID NO shown 

Respond with an integer M 
expressing the number 
of bases or amino acid 
res idues 



xno/W i I'M' 



<2 1 2> 



Type 



<213> 



Organism 



: * Whether presented 
sequence mole- 
cule is ON A, 
BNA, or PRT 
(protein) . I f 
^ a nucleotide 
sequence con- 
tains both DNA 
and ANA frag- 
ments, the 
type shall be 
M DNA." In ad- 
" dition, the 
combined DNA/ 
KNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. /* 

Scientific name, 
i.e. Genus/species/ 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 



M 



M 



<220> 



feature 



Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence. 



M, under the 
following condi- 
tions : i f "n, " 
"Xaa, " or a mod- 
i f i cd or unusua 1 
L-amino acid or 
modified base was 
used in a se- 
quence; i f ORGAN - 
ISM is "Artifi- 
cial Sequence" or 
"Unknown"; if 
molecule is 
combined DNA/RNA. 



<221> 



Name/ Kc y 



Provide appropriate 
ident i f icr for 
feature, pre- 
ferably from 
WIPO Standard 
ST. 2 5 (1998), 
Appendix 2, 
Tables 5 and 6 



M, under the fol- 
lowing conditions 
i f "n, " "Xaa, " or 
a mod i f i cd or un- 
usual L-amino 
acid or modi f icd 
base was used in 
a sequence 



<222> 



Loca t i on 



Spcci f y location 
within sequence; 
where appropriate 
state numbc r of 
first and last 
hascs/amino acids 



M # under the fol- 
lowing conditions 
i f "n, " "Xaa. " or 
a mod i f i cd or un- 
usual L-amino 
acid or mod i f i cd 



<223> 



Other Infor- 
mation 



in feature 



Other relevant 

information; 

four lines maximum 



Lc was u 5{fcJ 
Iqucncc 

M, under the fol- 
lowing conditions : 
f f "n, M "Xaa, " 'or 
a modified or un- 
usual L-amino acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is -Artificial 
Sequence" or 
"Unknown"; .i-f-=* ~ 
molecule is com- 
bined DNA/flNA^r* r 



<300> 



<301> 



Publication 
Information 

Authors 



Leave blank* 
after <300> 

Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format: 
Surname, Other 
Names and/or 
Initials 



0 



<302> 
<303> 
<304 > 
<30S> 
<30G> 
<307> 



< 300> 



<309> 



<310> 



Title 
Jou rna 1 
Vol umc 
Issue 
Pages 
Date 



0 



Database 
Acccs s i on 
Numbc r 



Database Entry 
Da tc 



Patent Document 
Number 



Journal date on which 
data publ i shed; 
specify as yyyy-nvn- 
dd, MMM-yyyy or 
Sea son- yyyy 

Accession number 
assigned by data- 
base including 
database name 

Date of entry in 
database; specify 
as yyyy-mm-dd or 
MMM-yyyy 

Document number; 
for patent-type 
citations only. 
Spec i f y as, for 
example, US 
07/999. 999 



* 



< 3 2 ) > 



<312> 



Patent TiJ 
Date 



Document filing 
dace, for patent- 
type citations only; 
specify as yyyy-mm-dd 



Publication Date 'Document publication 
date, for 
patent-type 
citations only; 
"specify as yyyy-mm-dd; 



<313> 



<400> 



Relevant 
Residues 

Sequence 



FROM (position) TO 
(posi tion) 

SCO ID NO should 
follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual* 
sequence 



5. Section 1.02< is revised to read as follov/s; 



"M 



V-' 



1.024 Form and. format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



(a) The computer readable form required by 
following specifications: 



1.021 (c) shal 1 meet the 



(J) The computer readable form shall contain a s i ng 1 c "Sequence Listing' 
as cither a diskette, series of diskettes, or other permissible media 
outlined in paragraph (c? of this section. 

(?) The "Sequence Listing 0 ." in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however, it shall conform to all specifications 
detailed in this section. 



(<) File compression is acceptable when using diskette media, so long as 
the compressed file is in a sc 1 f -c>: t ract i ng format that will decompress 
on one of the systems described in paragraph (b) of this section. 



(S) Page numbering shall not appear within 
version of the "Sequence Listing" file. 



he computer readable form 



(61 All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were 
recorded on the computer readable form, the operating system used, a 
reference number, and an application serial number and filing date, if 
l;nown. 

(b) Computer readable form submissions must meet these format 
rcqui remcnts : 

(11 Computer: I DM TC/XT/AT. or compatibles, or Apple Macintosh; 
(2» Operating System: MS-DOS. Unix or Macintosh; 



